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ABSTRACT 

We reconstruct the projected mass distribution of a massive merging Hubble Frontier 
Fields cluster MACSJ0416 using the genetic algorithm based free-form technique called 
Grale. The reconstructions are constrained by 149 lensed images identified by Jauzac et 
al. using HFF data. No information about cluster galaxies or light is used, which makes 
our reconstruction unique in this regard. Using visual inspection of the maps, as well 
as galaxy-mass correlation functions we conclude that overall light does follow mass. 
Furthermore, the fact that brighter galaxies are more strongly clustered with mass 
is an important confirmation of the standard biasing scenario in galaxy clusters. On 
the smallest scales, approximately less than a few arcseconds, the resolution afforded 
by 149 images is still not sufficient to confirm or rule out galaxy-mass offsets of the 
kind observed in ACO 3827. We also compare the mass maps of MACSJ0416 obtained 
by three different groups: Grale, and two parametric Lenstool reconstructions from 
the CATS and Sharon/Johnson teams. Overall, the three agree well; one interesting 
discrepancy between Grale and Lenstool galaxy-mass correlation functions occurs on 
scales of tens of kpc and may suggest that cluster galaxies are more biased tracers of 
mass than parametric methods generally assume. 

Key words: gravitational lensing: strong, galaxies: clusters: individual: MACS 
J0416.1+2403 


1 INTRODUCTION 

The phenomenon of bending of light due to the interven¬ 
ing matter, referred to as gravitational lensing, provides 
an opportunity to use massive clusters of galaxies as nat¬ 
ural telescopes (Fort & Mellier 1994; Kneib & Natarajan 
2011), and hence presents a unique tool to examine far 
away galaxies in detail. According to the standard ACDM 
model of cosmology, the first structures in the Universe form 
due to the gravitational instability of the dark matter and 
baryons which collapse under their own gravity (Albada 
1961; Peebles 1970). Later, massive galaxies and galaxy clus¬ 
ters formed by a series of mergers of the smaller systems 
(Peebles 1983; Blumenthal et al. 1984). In this hierarchical 
structure formation scenario there exists a large population 
of low and intermediate mass galaxies at higher redshifts. If 
these happen to sit behind a massive galaxy cluster, their 
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light will be bent, and their images distorted and magni¬ 
fied according to Einstein’s Theory of General Relativity. 
This magnification, which can be significant, is the motiva¬ 
tion behind the Hubble Frontier Fields (Bullock et al. 2012) 
program (hereafter HFF). The Space Telescope Science In¬ 
stitute selected six galaxy clusters, and associated parallel 
fields to gain a magnified view of the high redshift galax¬ 
ies behind them, and to investigate complex galaxy clusters 
doing the lensing. 

To use clusters as telescopes one must have a detailed 
and accurate characterization of their optics, or maps of 
the mass distribution. The present work is concerned with 
the mass distribution of one of the HFF clusters, MACS- 
J0416.1-2403 (hereafter MACSJ0416). It is a massive grav¬ 
itational lens at a redshift of 0.396, and right ascension 
04:16:09, and declination -24:03:58, (Mann & Ebeling 2012). 
The cluster shows features of a recent major merger or 
pre-merger (Jauzac et al. 2015; Ogrean et al. 2015), like 
double-peaked X-ray surface brightness (Mann & Ebeling 
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2012), elongation, and many sub-structures (Zitrin et al. 
2013; Grillo et al. 2015). It has been studied in detail for 
its mass distribution (Zitrin et al. 2013; Jauzac et al. 2014, 
2015; Grillo et al. 2015; Diego et al. 2015), magnification 
maps (Johnson et al. 2014; Richard et al. 2014), and mass 
power spectrum (Mohammed et al. 2016) using different in¬ 
version methods. 

The goal of the lens reconstruction techniques is to 
build mass models of the lens constrained by the gravi¬ 
tational lensing data, which, for strong lensing consists of 
identifications, positions and redshifts of multiple images of 
background sources. There are a few different lens inver¬ 
sion methods in existence, based on one of the two differ¬ 
ent approaches to lens reconstruction - parametric and non- 
parametric, or free-form. Some methods are hybrid (Diego et 
al. 2015), and incorporate features of both approaches. Para¬ 
metric methods assume an underlying functional form of the 
mass distribution of the lens, and the lensing data is used to 
constrain the parameters of the functional form. The number 
of parameters is usually small, and statistical inference can 
be made using regular sampling methods, like Markov-Chain 
Monte-Carlo (MCMC). Though in principle these physical 
models can take any form, in practice parametric models 
assign mass to cluster galaxies, assuming light-traces-mass, 
and add a few additional mass components to represent the 
cluster dark matter. One widely used example of such tech¬ 
niques is Lenstool (Julio et al. 2007). On the other hand, 
non-parametric models (or free-form) solve for the lensing 
mass by using the images data alone, with no reference to 
cluster galaxies. Some free-form methods match the num¬ 
ber of free parameters to the number of lensing observables, 
while others work with a parameter space whose dimension¬ 
ality greatly exceeds the number of constraints. An example 
of the latter type of method is Grale (Liesenborgs et al. 2006, 
2007, 2008, 2009) which uses no light information from the 
lens, and requires only the position and redshifts of the mul¬ 
tiple images of the background sources. In this paper we use 
Grale to construct mass models of MACSJ0416. 

Our paper has two goals: (i) to study the mass distri¬ 
bution of the cluster with respect to the light, and (ii) to 
compare results of two very different methodologies: free¬ 
form Grale with n par am n CO nstr, and light-traces-mass 
Lenstool, with nparam^nconatr- To accomplish the first 
goal in an unbiased way one needs a method that does not 
include cluster galaxies as input; Grale satisfies that crite¬ 
rion. One motivation for this is the recent discovery of an 
offset between a galaxy deep inside the gravitational poten¬ 
tial of a massive cluster, ACO 3827 and its dark matter halo 
(Williams & Saha 2011; Mohammed et al. 2014; Massey et 
al. 2015). Because our analysis is confined to the central 
portions of the cluster we do not expect to find mass-light 
discrepancies of the kind found by Merten et al. (2011) and 
Jauzac et al. (2015). Our second goal has been addressed by 
some recent papers (Zitrin et al. 2010; Coe et al. 2012), but 
usually these compare circularly averaged density profiles, 
which is probably not the most suitable statistic in this case 
as MACSJ0416 underwent a recent merger. We use a more 
detailed metric, namely mass-galaxy correlation function. 

In Section 2 we give a brief summary of the existing 
mass reconstructions of this cluster. In Section 3 we discuss 
our lens reconstruction method. Section 4 presents our re¬ 
sults, and in particular discusses light and mass offsets and 


galaxy-mass correlation function obtained with Grale, and 
two inversions with Lenstool. In Section 5 we summarize 
our findings. We use flat ACDM cosmology, with = 0.27, 
and h = 0.71. 


2 EXISTING MASS MODELS OF MACSJ0416 

Current literature on mass reconstructions of MACSJ0416 
have used free-form, hybrid and parametric methods to cal¬ 
culate the associated mass distribution. Zitrin et al. (2013) 
used two different parametric methods to determine the 
mass distribution of MACSJ0416. Their first method as¬ 
signed a PIEMD (pseudo-isothermal elliptical mass distri¬ 
bution) mass profile to every galaxy and smoothed the total 
resulting mass map with an elliptical Gaussian to obtain a 
cluster-wide dark matter distribution. The latter was added 
to the mass due to individual galaxies. The second method 
used the same PIEMD model for galaxies, and two elliptical 
NFW halos to describe the dark matter. The authors ran 
an MCMC algorithm to obtain the solution for both meth¬ 
ods. The large number of multiple images in this cluster was 
attributed to the extreme elongation of MACSJ0416. With 
their analysis the authors discovered 70 new multiple images 
from 23 sources. 

Johnson et al. (2014) modeled all 6 HFF clusters using 
pre-HFF data, and analyzed the resulting mass and magni¬ 
fication properties of the reconstructions. They followed the 
parametric approach by utilizing Lenstool to find the best 
solution for the mass distribution in the clusters. They as¬ 
signed a PIEMD to each cluster member, and to two cluster- 
scale components. The best solution was found through an 
iterative process of running an MCMC algorithm at each 
step. The final model was computed under image plane opti¬ 
mization, meaning the differences in the observed and model 
predicted positions were optimized in the lens plane. The au¬ 
thors report a smaller mass inside the z = 2 critical curve 
than Zitrin et al. (2013) because of different redshifts used 
in each model. 

Richard et al. (2014) adopted a strong- and weak- 
lensing approach to find the mass distribution of 
MACSJ0416. They used Lenstool with dual pseudo- 
isothermal elliptical (dPIE) mass profiles for each cluster 
member, and two cluster-wide mass clumps placed in the 
regions of high galaxy number density plus a third halo for 
a lower-redshift galaxy not belonging to the cluster. 

Jauzac et al. (2014) used Lenstool, which assigns a 
dPIE mass profile to cluster galaxies, and cluster wide com¬ 
ponents. Based on the HFF data the authors identify 51 
new multiply imaged systems, for a total of 68, comprising 
red 194 individual lensed images. Their best model produces 
lens plane rms of 0.68 ,/ for 149 images of 57 most securely 
identified sources. 

Grillo et al. (2015) used a parametric method called 
GLEE. They considered a range of ways of parametrizing 
the mass distribution in the cluster, and found that the best 
fit was obtained with a model that has two cored elliptical 
pseudo-isothermal mass distributions to represent the dark 
matter and 175 galaxy-size dual pseudo-isothermal mass dis¬ 
tributions with the mass-to-light scaling with luminosity. 
Their model reproduces the positions of 10 image systems, 
totalling 30 spectroscopically confirmed lensed images very 
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Table 1. Previous MACS0416 mass models with type of data 
used and number of multiple images. 


Author 

Type of Data 

Num. of 

Multiple 

Images 

Zitrin et al. (2013) LTM-Gauss 

pre-HFF 

34 

Zitrin et al. (2013) NFW 

pre-HFF 

34 

Johnson et al. (2014) 

pre-HFF 

34 

Richard et al. (2014) 

pre-HFF 

47 

Jauzac et al. (2014) 

HFF 

149 

Grillo et al. (2015) 

CLASH 

30 

Diego et al. (2015) 

CLASH 

96 


well, with lens plane rms of 0.3”. Their results are largely in 
agreement with previous reconstructions of the shape of the 
mass distribution. A comparison with simulated galaxy clus¬ 
ters with total masses similar to that of MACSJ0416 shows 
that the former contain considerably less mass in subhalos 
in their cores relative to MACSJ0416. 

Diego et al. (2015) used a hybrid approach to recon¬ 
struct the mass distribution of MACSJ0416. Three separate 
mass models were made for the galaxy component of the 
lens, two following the light traces mass assumption and the 
third linking every galaxy to a circularly symmetric NFW 
distribution. The rest of the mass distribution was modeled 
by a free-form method. These mass models showed a bimodal 
mass distribution, similar to the X-ray emission distribution, 
except for small offsets in the two peaks. Collisional effects 
such as dynamical friction are believed to be the reason be¬ 
hind the offsets between X-ray and dark matter distribu¬ 
tions. In addition, a flat mass profile was found on medium 
distance scales surrounding the two peaks of the mass dis¬ 
tribution, likely because of tidal forces, projection effects, or 
possibly self interacting dark matter. The authors find that 
overall light traces mass in MACSJ0416. 

Published mass reconstructions of MACSJ0416 are 
summarized in Table 1. Besides the Zitrin et al. (2013) LTM- 
Gauss method, every previous parametric method used two 
cluster-scale haloes. All previous reconstructions made use 
of the galaxy light in MACSJ0416 in some form when calcu¬ 
lating the mass distribution. We go about the reconstruction 
process differently, and use only the positions of the lensed 
images as input. Thus, light is not assumed to trace mass 
in our reconstruction processes, and our results represent a 
completely independent way of modelling MACSJ0416 and 
testing if light follows mass in clusters. 


3 Grale: FREE-FORM LENS 

RECONSTRUCTION METHOD 

Grale is a free-form technique of lens reconstruction that 
uses a genetic algorithm to calculate the mass distribution 
of a lens. Only the images’ identifications, locations and red- 
shifts are used as inputs for Grale. The images’ positions can 
be modeled either as point sources or as extended regions. 
For this paper we chose to use point sources to model the 
images. To start, a grid is set up with a uniform grid of cells. 
Within each cell a projected Plummer mass distribution is 
placed, fixing the width of each Plummer sphere. Then the 
genetic algorithm searches the parameter space of the Plum¬ 
mer amplitudes and determines the best solution to the lens 


equation and the corresponding mass distribution. A new 
grid is then formed from the original grid by increasing the 
number of cells in areas of high mass density. The genetic 
algorithm then calculates the mass distribution again. This 
process of increasing the density grid cells is repeated several 
times. Through each iteration, reproduction and mutation 
are used to breed new solutions. Thus, Grale obtains the 
final mass distribution (Liesenborgs et al. 2006). 


Grale uses fitness values to pick the best solution for 
the mass distribution. The fitness value of a genetic algo¬ 
rithm evaluates the degree to which the mass map repro¬ 
duces the lensing data, and hence if a particular solution is 
better or worse than others. In general Grale ’s fitness value 
can be based on one, two, or more fitness criteria; for this 
paper we chose to use two. One of the fitness value compo¬ 
nents depends upon image positions and the other uses the 
absence of images. For a given mass distribution, the lens 
equation can be used to project the images from the same 
source back to the source plane. Grale uses the size of the 
region containing all back-projected images as a length scale. 
Thus an absolute scale is not used, because this would fa¬ 
vor solutions that place the images into smaller source plane 
regions, and result in overfocussing. The first component of 
Grale’s fitness value is calculated by measuring the distance 
between all back-projected images of the same source, using 
the aforementioned length scale. The second component is 
based on spurious extra images. In complex lensing systems, 
additional images could exist that trace back to sources. If 
the modeler is sure that these extra images do not exist, 
then the null space of Grale is defined to be the area of no 
images in the image plane. The null space is divided in trian¬ 
gles and each triangle is back-projected to the source plane. 
The second component of Grale’s fitness is the number of 
all the triangles that back-projects to the region containing 
all back-projected images. This paper takes the product of 
the two fitness components to find a single value to compare 
the different iterations of a given run. 


We generated 30 Grale reconstructions, each one using 
nine successive lensing grid refinements. We used 149 secure 
images as outlined in Jauzac et al. (2014) and updated red- 
shifts from Grillo et al. (2015). For each grid the number 
of Plummer distributions was chosen at random from a se¬ 
lected range. The first lensing grid’s range was 300-400 and 
the last, ninth lensing grid’s range was 1700-1800. Each sub¬ 
sequent grid had a greater density of Plummer spheres. In 
addition, the center coordinates and size of each grid was 
allowed to fluctuate by 5” and 10” respectively to eliminate 
the imprint of a fixed grid on the solution. Finally, for each 
run the best mass map was selected depending upon the 
fitness value described above. Each reconstruction’s results 
are somewhat different because the random seed used by 
Grale to initialize a run enables it to explore different re¬ 
gions of the large dimensional model parameter space. To 
reduce the random variations and to enhance the common 
features in mass maps of different runs, the runs—30 in our 
case—are averaged to produce one final mass map, which is 
also a solution, as discussed in Mohammed et al. (2014). 
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4 RESULTS 

The average map of the 30 individual Grale reconstructions 
is shown in Fig. 1; this map is the basis of our analysis. The 
blue lines are the mass density contours and the red circles 
indicate images. There are 20 contour lines linearly spaced 
from 0.03827 g/cm 2 up to 0.76531 g/cm 2 . Using the 30 in¬ 
dividual maps we can calculate the fractional uncertainty 
defined at every location in the lens plane as the ratio of the 
root-mean-square deviation between the maps’ k values and 
the average k of all the maps. (k is the projected surface 
mass density in units of critical density for lensing). The 
fractional uncertainty highlights areas in the mass distribu¬ 
tion that contain small and large uncertainty. As seen in 
Fig. 2, there are some small areas within the central parts of 
the galaxy cluster that contain large fractional uncertainty, 
30-40% (where contour lines are green and the density of 
lines is greater), however most of the galaxy cluster is mini¬ 
mal in fractional uncertainty, usually below 15% and in some 
regions below 5% (where contours are thin and the density 
of lines is lower), because Grale is very well constrained by 
the images in this region. Large fractional uncertainty val¬ 
ues can be seen outside the central elongated region of the 
cluster where Grale does not have many constraints. 

Grale, or any other lens reconstruction method that 
does not explicitly place mass at the location of visible galax¬ 
ies as part of its input is the right tool for investigating how 
well mass follows light on the scale of galaxies in the clus¬ 
ter. No other published lensing inversion of MACSJ0416 is 
form-free in this sense. Here, we investigate this using two 
approaches, described in the next subsections. 

4.1 Mass Contours and Local Mass Peak Offsets 

One way to determine if light traces mass is to analyze the 
projected mass contours, shown in Fig. 1, and their relation 
to the galaxies. 

It is apparent that the overall mass distribution of 
the cluster, including its elongation is well reproduced. 
Grale finds two prominent cluster-wide mass clumps; these 
correspond to the two parametric dark matter components 
used in most of the models of MACSJ0416 described in Sec¬ 
tion 2. Visual inspection shows that close to the center of 
the cluster, where the map is best constrained the mass den¬ 
sity contours around galaxies encircle the galaxies indicat¬ 
ing that Grale places mass concentration at those locations, 
even though galaxies are not part of the input. Towards 
the edges of the cluster, where the mass is least constrained, 
some contour lines go through galaxies, indicating that these 
galaxies are not ‘detected’ by Grale. 

One of our goals is to determine how well mass fol¬ 
lows light in the immediate vicinity of galaxies. These lat¬ 
ter galaxies that Grale does not detect cannot be used for 
the purpose, but the former ones can. There are four such 
galaxies along the central ridge line of MACSJ0416; they are 
marked in Fig. 1 and have magenta labels G1-G4. These re¬ 
gions also correspond to low fractional uncertainties, < 10%, 
in reconstructed mass. 

A closer inspection of the mass contours around these 
four galaxies reveals that the peaks of the mass distribu¬ 
tion are displaced from the closest galaxy cluster member. 
To find out if the displacement is statistically significant we 


Table 2. Offsets between galaxies G1-G4 and 30 mass models 
peaks in arcseconds. The third column is the root-mean-square 
dispersion between the galaxies and mass peaks. The last column 
is the significance, ratio between the distance and root-mean- 
square. 


Galaxy 

Dist. (") 

RMS (") 

Dist. 

RMS 

G1 

2.77 

3.62 

0.77 

G2 

0.94 

2.16 

0.43 

G3 

5.94 

3.62 

1.64 

G4 

2.20 

2.35 

0.94 


can look at the scatter of the positions of the local den¬ 
sity peaks in the 30 individual mass maps. The green dots 
in Fig. 1 mark the highest single mass pixel in each of the 
30 reconstructions, in circular regions of radius 8" around 
the four galaxies. Even though the galaxy positions are not 
centered on the corresponding cloud of 30 mass peaks, they 
are also not significantly offset from that distribution. Ta¬ 
ble 2 shows the distance in arcseconds between galaxy and 
average position of the 30 mass peaks. The significance can 
be defined as the ratio between the offset and root-mean- 
square dispersion. The last column of Table 2 shows none of 
the galaxies to be significantly offset from the mass peaks. 
We conclude that there is no compelling evidence that the 
mass in the immediate vicinity of the central galaxies does 
not trace the light. This is in contrast to the case of galaxy 
N1 in ACO 3827 (Williams & Saha 2011; Mohammed et 
al. 2014; Massey et al. 2015), where the displacement be¬ 
tween the galaxy and the nearest mass peak is 0.891 q' 2 t "•> 
or 1.62 ±o .49 kpc. 

The width of the distribution of the 30 local mass peaks 
from individual reconstructions, i.e. our uncertainty, is ~5 ,/ , 
and represents the smallest offset we could have detected, 
if these were present. Given that the galaxy-mass offset in 
ACO 3827, ~0.9 ,/ , yields dark matter self-interaction cross 
section that is approximately the same as the upper limit 
from other studies (Clowe et al. 2006; Randall et al. 2008; 
Kahlhoefer et al. 2014, 2015), means that the level of uncer¬ 
tainty in MACSJ0416 will not allow a more stringent upper 
limit on dark matter self-interaction cross section. Although 
the total number of images in MACSJ0416 is high, wl50, it 
appears that their distribution, for example the proximity to 
the four galaxies, or the accuracy of the source redshifts, are 
not adequate to constrain the offsets at a level comparable to 
those seen in ACO 3827. The role of image (or image knot) 
number density in accurately constraining mass maps was 
discussed in Liesenborgs et al. (2008). The authors showed 
that the monopole degeneracy, a way to redistribute mass 
between images by adding circularly symmetric density dis¬ 
tributions with zero total mass, does not change image posi¬ 
tions, and is largely responsible for uncertainty in the mass 
determination. The prevalence of monopole degeneracy will 
depend on whether the distribution of images allows adding 
such circular regions. It is possible that this is easier to do 
near the galaxies G1-G4 in MACSJ0416 than in ACO 3827. 
It is possible that in other HFF clusters the configuration of 
the images is more fortuitous for the detection of possible 
offsets. 
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Figure 1. Mass contours of averaged mass map of MACSJ0416 red overlaid on a HST F435W image. The blue lines represent the mass 
density contours and the red circles are images. There are 20 mass contour levels linearly spaced from 0.03827 g/cm 2 up to 0.76531 
g/cm 2 . red The green dots represent local mass peaks around the 4 central ridge galaxies, in 30 individual mass reconstructions. These 
galaxies are labelled G1-G4, with G1 appearing the brightest. 


4.2 Mass-Galaxy Correlation Function: Grale 

A further measure of how well mass follows light is pro¬ 
vided by the correlation function, £(0) between galaxies and 
the average reconstructed mass map. The projected galaxy- 
mass correlation function describes how galaxies and mass 
are clustered, as a function of separation, 8, on the sky. It 
is defined through conditional probability, dP of finding a 
galaxy in a volume dV, a distance 8 away from another 
galaxy, dP = n( 1 + £)dV , where n is the average number 
density of galaxies. It typically decreases with separation, 
after attaining the largest amplitude at zero. We use the 
estimator £(0) = — 1, where D m Dg represents the 

number of mass pixel-galaxy pairs, and ( D m R g ) is the av¬ 
erage number of pairs of 100 trials, where the positions of 
the galaxies have been randomized. 

The galaxy-mass correlation function between the 
galaxies in the Subaru R-band catalog in MACSJ0416 and 
the average mass map is shown in Fig. 3. It was computed 
using a bin size of 0.52"within a region of area 5903.52 
squared arcseconds and enclosing galaxies and images along 


the cluster’s line of elongation. This is the line extending 
through the two brightest cluster galaxies of the cluster. We 
chose Subaru galaxies over HST galaxies because Subaru 
catalog contains magnitudes for all HST galaxies, including 
those that are too bright for HST. Furthermore, Subaru fil¬ 
ters we use are closely matched by HST filters; specifically, 
galaxy magnitudes in Subaru Z (R) band are tightly corre¬ 
lated with HST F814W (F606W) magnitudes. Since Subaru 
is a ground-based telescope, the light profiles of the bright¬ 
est galaxies are extended, and may block fainter galaxies. 
To reduce this bias, we masked the 10 brightest galaxies in 
the Subaru R-band with a circle of radius 8” before calcu¬ 
lating the correlation functions. Fig. 3 shows £(#) for several 
galaxy magnitude cuts; the number of galaxies in each cut 
is indicated in the figure. 

If galaxies traced the underlying mass distribution ac¬ 
cording to the standard biasing scheme one would expect 
that the brightest galaxies would show the strongest clus¬ 
tering with the mass, and as the galaxy magnitude limit 
is pushed towards lower fluxes, the amplitude of the corre- 
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Figure 2. Contours of fractional uncertainty in mass density of MACSJ0416. The contours are separated into two groups distinguishable 
by their colors. The six blue contour lines range from 3.3% to 19.8% in fractional mass density uncertainty, and the seven green lines 
range from 23.1% to 49.5%. Images are highlighted by red circles and the two BCGs are marked as magenta crosses. 


lation function would decrease due to two reasons: fainter 
galaxies contain less mass, and are thus biased towards less 
mass. This is indeed what is seen. Because Grale input does 
not include any information about the galaxies, the trend of 
brighter galaxies being more biased towards mass is an im¬ 
portant confirmation of the standard biasing scenario. 

Fig. 3 also shows that at very small separations, < 2", 
the correlation function decreases, instead of increasing as 
separation approaches zero. Taken at face value, this would 
be indicative of an offset between dark and light matter in 
cluster galaxies. However, the uncertainties in £, which are 
approximately 0.025, are too large to make that conclusion. 
This behaviour at small separations and its marginal statis¬ 
tical significance was already seen in Section 4.1. 

While it is safe to assume that bright galaxies are mostly 
cluster members, the same cannot be said of fainter ones. 
One way to find the apparent magnitude below which most 
galaxies are background to the cluster is to use lensing mag¬ 
nification bias. Behind a galaxy cluster lensed galaxies are 
made brighter than unlensed ones at the same redshifts and 
the area behind the lens is simultaneously stretched by the 


same magnification factor. To predict the net effect of this 
bias we can look at the differential galaxy counts as a func¬ 
tion of apparent magnitude. Fig. 4 shows these counts based 
on the entire Subaru field; using just the galaxies in the 
direction of the cluster would produce very noisy counts. 
Magnification bias decreases galaxy counts at magnitudes 
where the slope of the counts is shallow, and increases them 
at magnitudes where the slope is steep. When the slope is 
equal to one, dlog(n[m])/dlog(/) = 1, there is no magnifi¬ 
cation bias because the flux magnification and area dilution 
cancel each other out. In principle, one should look at the 
unlensed counts slope for this purpose, which are unobserv¬ 
able, but in the case of relatively shallow counts, the lensed 
observed counts provide a reasonable approximation. 

Because at magnitudes where dlog(n[m])/dlog(/) < 1 
area dilution wins over flux magnification, magnification bias 
predicts anti-correlation between cluster mass and back¬ 
ground galaxies. According to Fig. 4, dlog(n[m])/dlog(/) 
becomes shallower than 1 below m « 20, for both R and Z 
magnitudes. We select two magnitude cuts, 20^m^24 and 
24^25.5; we did not use galaxies fainter than 25.5 be- 
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Figure 3. Normalized galaxy-mass cross-correlation function for 
Subaru R-band at multiple magnitude cuts. The mass map is the 
average of 30 individual Grale reconstructions. Rms dispersion 
between the 100 realizations of D m R g is approximately 0.025 for 
all magnitude ranges. Shaded regions indicate la error bars. 


cause beyond this magnitude counts suffer significantly from 
incompleteness. 

For the 20 ^ m ^ 24 cut we expect modest anti¬ 
correlation for the R-band, and somewhat larger anti¬ 
correlation for the Z-band, because Z-band counts are shal¬ 
lower than those in R-band. The same masking procedure 
was applied to the Z-band for computing the correlation 
function. The computed correlation functions for both bands 
are shown in violet in the lower middle left and right pan¬ 
els of Fig. 5. Indeed, the Z-band galaxies show less correla¬ 
tion than the R-band galaxies. However, neither the R-band 
selected galaxies nor the Z-band selected galaxies show an 
anti-correlation between Grale cluster mass and galaxies. 
This might indicate that some of the galaxies in this magni¬ 
tude range are cluster members, and so the total correlation 
function signal consists of a superposition of mass clustering 
with the cluster member galaxies, and magnification bias 
resulting from the cluster mass lensing background galaxies. 

At fainter galaxy magnitudes, 24 25.5, we expect 

that most, if not all galaxies are background to the cluster, 
and so magnification bias will be the only effect. Consis¬ 
tent with this expectation we find a strong anti-correlation 
between cluster mass and cluster galaxies for both Subaru 
magnitude bands (we show only R band results here). Thus, 
different magnitude cuts show features that imply that mag¬ 
nification bias exists in MACSJ0416, and so many of the 
fainter galaxies are significantly background to the cluster. 
Magnification bias is sometimes used to aid mass reconstruc¬ 
tion in clusters (Umetsu & Broadhurst 2008; Umetsu et al. 
2015). 

We note that the (anti-)correlations extend to about 
20", or 100 kpc at the redshift of the cluster (Fig. 3), which is 



Figure 4. Differential number counts of galaxies over the entire 
Subaru field, not just the galaxies behind the cluster. The vertical 
dashed line marks the magnitude below which counts are incom¬ 
plete; we did not include these galaxies in any of the analysis. The 
marked line, d log (n [ rn ]) /4 ]og(/) = 1, is provided for comparison 
to the two bands. 

roughly the typical separation between bright, m < 20 galax¬ 
ies, and considerably smaller than the size of the cluster, 
120"x50". 


4.3 Mass-Galaxy Correlation Function: Grale vs. 

Lenstool 

A number of metrics can be used to compare mass recon¬ 
structions from different lens inversion methods. One can 
look at the differences, or fractional differences between two 
k maps on the x, y plane, however, summarising that in¬ 
formation in a concise way is difficult. One can also look at 
the circularly averaged radial density profiles of the different 
maps, however, this entails a lot of averaging, which hides 
many potentially interesting differences. We use the correla¬ 
tion function, which is similar to circular averaging, but is 
done around each galaxy, instead of just the center of the 
cluster. It is a good compromise between too much and too 
little detail. 

We compare Grale results to those of two groups that 
use different implementations of Lenstool, CATS team 
and Sharon/Johnson team, which are presented in Jauzac 
et al. (2015) and Johnson et al. (2014), respectively, and are 
available for download on the HST MAST website. In this 
work, we limit our comparison to reconstructions based on 
Lenstool only, because Grale and Lenstool are at the op¬ 
posite ends of the spectrum of assumptions going into lens 
inversion methods. We leave a broader comparison to a later 
work. The CATS reconstruction is based on the HFF strong 
and weak lensing data, and Sharon/Johnson reconstruction 
uses pre-HFF strong lensing data. Both assume flat ACDM 
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Figure 5. Galaxy-mass cross-correlation functions for five Subaru R-band and one Z-band magnitude cut. Three mass maps are used: 
from the Grale team (violet), CATS team (gold), and Sharon/Johnson (red). Shaded regions indicate lcr error bars. 


cosmology, with = 0.3, and h = 0.7. This is different 
from our assumed cosmology, but the difference results in 
negligible deviations of length scales, ~ 0.5%. 

Figure 5 plots mass-galaxy correlation function for six 
different galaxy magnitude cuts. All but the bottom mid¬ 
dle panel use Subaru R-band to select galaxies; the bottom 
middle panel uses Z-band. The magnitude cuts in the six 
panels are mu $( 20, mu ^ 22, mu ^ 24, 20 $( mu St 24, 
20 ^ mz ^ 24, and 24 ^ mu 25.5. A cursory look at these 
reveals that all three teams recover very similar clustering 


properties. The fact that parametric and free-form methods 
yield very similar results is encouraging, and leads to two 
conclusions, (a) strong priors about mass following light are 
not required to recover mass distribution in clusters with 
very many lensed images, (b) light follows mass quite well 
in this merging cluster. 

Though Grale and the two implementations of 
Lenstool agree overall quite well, there are notable dif¬ 
ferences between the three. 

CATS and Sharon/Johnson correlation functions have 
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a sharp spike near zero separations. This tight correlation 
is the result of Lenstool placing mass at the locations of 
the visible galaxies. The central spike in Lenstool models 
can be also seen in the projected mass power spectrum of the 
cluster studied by Mohammed et al. (2016): see the left panel 
of their Fig. 9, where the fluctuation power at large k does 
not fall as fast as Grale’s. Grale, which is blind to galaxies, 
still detects galaxies as gauged by this metric, but does not 
associate as much mass with them due to a combination of 
two reasons, (i) the positions of lensed images do not require 
it, and (ii) Grale does not have sufficient spatial resolution. 

Aside from the central spike, the correlation func¬ 
tions of the two implementations of Lenstool, CATS and 
Sharon/Johnson, disagree with each other at the same level 
as they disagree with Grale. In fact, in the range 9 = 
2"—20” CATS and Grale are closer to each other than CATS 
and Sharon/Johnson. This behaviour is probably the result 
of the data sets used: Grale and CATS use HFF, while 
Sharon/Johnson use pre-HFF. 

Given that CATS and Grale teams use the same strong 
lensing data sets, the difference between these two should be 
entirely due to the differences in the reconstruction method. 
A closer examination shows that at separations 2" — 10” and 
magnitude cuts above mu = 24, CATS team correlation 
functions’ amplitude is below that of Grale —one can say it 
shows a dip—and below a smooth extrapolation of £(#) from 
larger separations. On these scales Grale’s £(#) looks smooth 
and rises monotonically in that region. In other words, com¬ 
pared to Grale, CATS Lenstool mass maps show a steeper 
decline in correlation function with increasing separation. 
This is likely a consequence of how Lenstool builds its 
mass maps as a superposition of large, smooth dark matter 
component(s) and mass spikes due to galaxies. There is no 
intermediate mass component that could bridge the gap and 
introduce more mass on these length scales around galaxies. 

Grale’s £(#) gentler decline with 9 on scales 2” —10” or 
10 — 50 kpc may suggest that there is more galaxy-mass 
correlation on these scales, i.e. that galaxies, and especially 
those in the magnitude range 22 < mu 24, are more bi¬ 
ased traces of mass in clusters than Lenstool assumes. 
In this context it is interesting to recall a recent discovery 
of nearly a thousand low surface brightness galaxies with 
1 kpc < R e < 3 kpc in the center of Coma cluster (Koda 
et al. 2015). Since these galaxies are old and evolved, they 
likely existed in clusters at higher redshifts, comparable to 
those of HFF clusters. Though faint and likely low mass, 
they still outnumber cluster members. These are not part 
of Lenstool input, and if they cluster with brighter galax¬ 
ies, they could contribute to higher correlation amplitude on 
scales of tens of kpc. 

In general, the somewhat different galaxy-mass cluster¬ 
ing amplitudes found by the three groups imply distinct 
models of how mass is distributed in clusters, and have differ¬ 
ent implications for the cluster structure and evolution. For 
example, a larger galaxy-mass clustering amplitude, such as 
obtained by Lenstool vs Grale, may imply that galaxies 
form at much higher density peaks of the matter distribu¬ 
tion, or that only the most compact galaxies can survive in 
cluster centers. Hydrodynamic numerical simulations could 
help to further determine the implications of the degree of 
biasing within clusters. 


5 CONCLUSIONS 

We carried out a lens inversion of MACSJ0416 using Grale, a 
free-form genetic algorithm based method. The only inputs 
were 149 lensed images, identified by Jauzac et al. (2014) 
based on HFF data. First, we summarize our results on the 
comparison between Grale and the two Lenstool mass re¬ 
constructions of this massive, merging 2 ~ 0.4 cluster. Be¬ 
cause the true mass distribution in galaxy clusters is un¬ 
knowable, it is important to critically compare the com¬ 
monly used estimators. Our conclusions based on these two 
very different methodologies are: 

• Though the mass maps of Grale and Lenstool re¬ 
constructions look somewhat different in detail, Grale and 
Lenstool (specifically, CATS and Sharon/Johnson teams) 
recover very similar statistics of the mass-galaxy correlation. 

• The most striking, but expected difference is that 
Lenstool’s galaxy-mass correlations show a pronounced 
spike near zero separation. This is because Lenstool places 
a lot of mass at the locations of galaxies as part of its input, 
whereas Grale does not. 

• Another notable difference is that on scales of 2" —10”, 
or tens of kpc, Grale’s correlation function falls less steeply 
than Lenstool’s. It is possible that the lensing constraints 
are compatible with both the steeper and the shallower de¬ 
cline. However, it is also possible that the mass distribu¬ 
tion is more extended on these scales around galaxies, as 
Grale suggests, due to the presence of many hundreds of 
low surface brightness galaxies of the kind recently detected 
in Coma (Koda et al. 2015). 


The fact that two very different methodologies— 
Grale and Lenstool —give similar results leads us to con¬ 
clude that when lensed image number is around 100 or more, 
the images alone are sufficient to recover the mass distribu¬ 
tion in clusters very well. Strong priors on galaxies are not 
needed. 

Our conclusions regarding the mass vs. fight distribu¬ 
tion in MACSJ0416 are: 


• Even with the high number of images present in 
MACSJ0416, no significant mass-light offsets are found be¬ 
tween the four central galaxies and the nearest mass peaks, 
in contrast to ACO 3827. Our uncertainties are ~ 5”, larger 
than 0.9” offset observed in ACO 3827. It is possible that 
other HFF clusters have a more fortuitous image configura¬ 
tion and hence smaller uncertainties around the bright clus¬ 
ter galaxies. 

• Overall, on scales larger than a few arcseconds, light 
traces mass, as reconstructed by Grale, in the merging clus¬ 
ter MACSJ0416 quite well, as measured by the galaxy-mass 
correlation function. This is the only analysis of this merging 
cluster that does not use any information about the visible 
fight, hence the conclusion that fight follows mass is not 
trivial. 

• The faintest galaxies in the direction of MACSJ0416 are 
anti-correlated with the cluster mass, implying the presence 
of the lensing magnification bias 
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